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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GAGAGATGGG GGAGGCTAAC TGAG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GATCCTCAGT TAGCCTCCCC CATCTCTC 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs ; 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear > 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TATATCTCCA GATGAGGTAC ATGATTTTAG GCTTG 

i 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single / 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
TATATATCGA TTCAAGGCAT TTTCTTTTCA ?CAATAAAAC 

(2) INFORMATION FOR SEQ ID NO : 5 : / 
(i) SEQUENCE CHARACTERISTIC^ : 

(A) LENGTH: 3 7 base pa/irs 

(B) TYPE: nucleic acid! 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TCGAGGATCC GCCCGGGCGG CCGCATCGAT GTCGACG 

(2) INFORMATION FOR SEQ ID NO:. 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCGTCGACA TCGATGCGGC CGCCCGGGCG GATCC 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AGTGAATTCG AGCTCGGTAC CCGGGGATCC TCTAGAGTCG ACCTGCAGGC ATGCAAGCTT 6 0 

GGCGTAATCA TGGTCAT 7 7 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
Ala Arg Glu Met Gly Glu Ala Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCCGAGAGAT GGGGGAGGCT AACTGAG 27 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGGCTCTCTA CCCCCTCCGA TTGACACCTA G 31 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
Thr lie Met Thr Met 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCCTGTGCCT TATTTGAACT AACC 2 4 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ( genomic ) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCCACCACAA CCACATATCC CTCC 2 4 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGTCCTCC GATTGACTG 19 

(2) INFORMATION FOR SEQ ID ?JO:15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GCGCCAGTCC TCCGATTGAC TGAGTCGCCC GGGTACCCGT GTATCCAATA AACCCTCTTG 60 

CAGTTGCATC CGACTTGTGG TCTCGCTGTT CCTTGGGAGG GTCTCCTCTG AGTGATTGAC 12 0 

TACCCGTCAG CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCGGGAGA CCCCTGCCCA 180 

GGGACCACCG ACCCACCACC QGGAGGTAAG CTGGCCAGCA ACTTATCTGT GTCTGTCCGA 240 

TTGTCTAGTG TCTATGACTG ATTTTATGCG CCTGCGTCGG TACTAGTTAG CTAACTAGCT 3 00 

CTGTATCTGG CGGACCCGTG GTGGAACTGA CGAGTTCGGA ACACCCGGCC GCAACCCTGG 3 60 

GAGACGTCCC AGGGACTTCG GGGGCCGTTT TTGTGGCCCG ACCTGAGTCC AAAAATCCCG 42 0 

ATCGTTTTGG ACTCTTTGGT GCACCCCCCT TAGAGGAGGG ATATGTGGTT CTGGTAGGAG 480 

ACGAGAACCT AAAACAGTTC CCGCCTCCGT CTGAATTTTT GCTTTCGGTT TGGGACCGAA 540 

GCCGCGCCGC GCGTCTTGTC TGCTGCAGCA TCGTTCTGTG TTGTCTCTGT CTGACTGTGT 6 00 



TTCTGTATTT GTCTGAGAAT ATGGGCCAGA CTGTTACCAC TCCCTTAAGT TTGACCTTAG 66 0 
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GTCACTGGAA AGATGTCGAG CGGATCGCTC AJCAACCAGTC GGTAGATGTC AAGAAGAGAC 72 0 

GTTGGGTTAC CTTCTGCTCT GCAGAATGGC CAACCTTTAA CGTCGGATGG CCGCGAGACG 780 

GCACCTTTAA CCGAGACCTC ATCACCCAGG TTAAGATCAA GGTCTTTTCA CCTGGCCCGC 840 

ATGGACACCC AGACCAGGTC CCCTACATCG TGACCTGGGA AGCCTTGGCT TTTGACCCCC 9 00 

CTCCCTGGGT CAAGCCCTTT GTACACCCTA AGCCTCCGCC TCCTCTTCCT CCATCCGCCC 9 60 

CGTCTCTCCC CCTTGAACCT CCTCGTTCGA CCCCGCCTCG ATCCTCCCTT TATCCAGCCC 102 0 

TCACTCCTTC TCTAGGCGCC AAACCTAAAC CTCAAGTTCT TTCTGACAGT GGGGGGCCGC 1080 

TCATCGACCT ACTTACAGAA GACCCCCCGC CTTATAGGGA CCCAAGACCA CCCCCTTCCG 1140 

ACAGGGACGG AAATGGTGGA GAAGCGACCC CTGCGGGAGA GGCACCGGAC CCCTCCCCAA 12 00 

TGGCATCTCG CCTACGTGGG AGACGGGAGC CCCCTGTGGC CGACTCCACT ACCTCGCAGG 12 6 0 

CATTCCCCCT CCGCGCAGGA GGAAACGGAC AGCTTCAATA CTGGCCGTTC TCCTCTTCTG 13 2 0 

ACCTTTACAA CTGGAAAAAT AATAACCCTT CTTTTTCTGA AGATCCAGGT AAACTGACAG 13 8 0 

CTCTGATCGA GTCTGTTCTC ATCACCCATC AGCCCACCTG GGACGACTGT CAGCAGCTGT 14 40 

TGGGGACTCT GCTGACCGGA GAAGAAAAAC AACGGGTGCT CTTAGAGGCT AGAAAGGCGG 15 00 

TGCGGGGCGA TGATGGGCGC CCCACTCAAC TGCCCAATGA AGTCGATGCC GCTTTTCCCC 1560 

TCGAGCGCCC AGACTGGGAT TAGACCACCC AGGCAGGTAG GAACCACCTA GTCCACTATC 162 0 

GCCAGTTGCT CCTAGCGGGT CTCCAAAACG CGGGCAGAAG CCCCACCAAT TTGGCCAAGG 16 80 

TAAAAGGAAT AACACAAGGG CCJCAATGAGT CTCCCTCGGC CTTCCTAGAG AGACTTAAGG 17 40 

AAGCCTATCG CAGGTACACT CCTTATGACC CTGAGGACCC AGGGCAAGAA ACTAATGTGT 18 00 

CTATGTCTTT CATTTGGCAG TCTGCCCCAG ACATTGGGAG AAAGTTAGAG AGGTTAGAAG 18 60 

ATTTAAAAAA CAAGACGCTT GGAGATTTGG TTAGAGAGGC AGAAAAGATC TTTAATAAAC 192 0 

GAGAAACCCC GGAAGAAAGA GAGGAACGTA TCAGGAGAGA AACAGAGGAA AAAGAAGAAC 19 8 0 

GCCGTAGGAC AGAGGATGAG CAGAAAGAGA AAGAAAGAGA TCGTAGGAGA CATAGAGAGA 2 04 0 

TGAGCAAGCT ATTGGCCACT GTCGTTAGTG GACAGAAACA GGATAGACAG GGAGGAGAAC 210 0 

GAAGGAGGTC CCAACTCGAT CGCGACCAGT GTGCCTACTG CAAAGAAAAG GGGCACTGGG 2160 

CTAAAGATTG TCCCAAGAAA CCACGAGGAC CTCGGGGACC AAGACCCCAG ACCTCCCTCC 2 22 0 

TGACCCTAGA TGACTAGGGA GGTCAGGGTC AGGAGCCCCC CCCTGAACCC AGGATAACCC 22 8 0 

TCAAAGTCGG GGGGCAACCC GTCACCTTCC TGGTAGATAC TGGGGCCCAA CACTCCGTGC 2340 

TGACCCAAAA TCCTGGACCC CTAAGTGATA AGTCTGCCTG GGTCCAAGGG GCTACTGGAG 2 40 0 

GAAAGCGGTA TCGCTGGACC ACGGATCGCA AAGTACATCT AGCTACCGGT AAGGTCACCC 2 460 

ACTCTTTCCT CCATGTACCA GACTGTCCCT ATCCTCTGTT AGGAAGAGAT TTGCTGACTA 2 52 0 
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AACTAAAAGC CCAAATCCAC TTTGAGGGAT CAGGAGCTCA GGTTATGGGA CCAATGGGGC 2 5 80 

AGCCCCTGCA AGTGTTGACC CTAAATATAG AAGATGAGCA TCGGCTACAT GAGACCTCAA 1:64 0 

AAGAGCCAGA TGTTTCTCTA GGGTCCACAT GGCTGTCTGA TTTTCCTCAG GCCTGGGCGG 27 00 

AAACCGGGGG CATGGGACTG GCAGTTCGCC AAGCTCCTCT GATCATACCT CTGAAAGCAA 27 60 

CCTCTACCCC CGTGTCCATA AAACAATACC CCATGTCACA AGAAGCCAGA CTGGGGATCA 2 820 

AGCCCCACAT AC AG AG AC TG TTGGACCAGG GAATACTGGT ACCCTGCCAG TCCCCCTGGA 2 8 80 

ACACGCCCCT GCTACCCGTT AAGAAACCAG GGACTAATGA TTATAGGCCT GTCCAGGATC 2 940 

TGAGAGAAGT CAACAAGCGG GTGGAAGACA TCCACCCCAC CGTGCCCAAC CCTTACAACC 3 000 

TCTTGAGCGG GCTCCCACCG TCCCACCAGT GGTACACTGT GCTTGATTTA AAGGATGCCT 3 060 

TTTTCTGCCT GAGACTCCAC CCCACCAGTC AGCCTCTCTT CGCCTTTGAG TGGAGAGATC 312 0 

CAGAGATGGG AATCTCAGGA CAATTGACCT GGACCAGACT CCCACAGGGT TTCAAAAACA 3180 

GTCCCACCCT GTTTGATGAG GCACTGCACA GAGACCTAGC AGACTTCCGG ATCCAGCACC 32 40 

CAGACTTGAT CCTGCTACAG TACGTGGATG ACTTACTGCT GGCCGCCACT TCTGAGCTAG 3 3 00 

ACTGCCAACA AGGTACTCGG GCCCTGT^AC AAACCCTAGG GAACCTCGGG TATCGGGCCT 3 3 60 

CGGCCAAGAA AGCCCAAATT TGCCAGAAAC AGGTCAAGTA TCTGGGGTAT CTTCTAAAAG 3 42 0 

AGGGTCAGAG ATGGCTGACT GAGGCCAGAA AAGAGACTGT GATGGGGCAG CCTACTCCGA 3 4 80 

AGACCCCTCG ACAACTAAGG GAGTTCCTAG GGACGGCAGG CTTCTGTCGC CTCTGGATCC 3 540 

CTGGGTTTGC AGAAATGGCA GCCCCCTTGT ACCCTCTCAC CAAAACGGGG ACTCTGTTTA 3 6 00 

ATTGGGGCCC AGACCAACAA AAGGOCTATC AAGAAATCAA GCAAGCTCTT CTAACTGCCC 3 6 60 

CAGCCCTGGG GTTGCCAGAT TTGACTAAGC CCTTTGAACT CTTTGTCGAC GAGAAGCAGG 3 720 

GCTACGCCAA AGGTGTCCTA ACGC^AAAAC TGGGACCTTG GCGTCGGCCG GTGGCCTACC 3 7 80 

TGTCCAAAAA GCTAGACCCA GTAQCAGCTG GGTGGCCCCC TTGCCTACGG ATGGTAGCAG 3 84 0 

CCATTGCCGT ACTGACAAAG GATQCAGGCA AGCTAACCAT GGGACAGCCA CTAGTCATTC 3 900 

TGGCCCCCCA TGCAGTAGAG GCACTAGTCA AACAACCCCC CGACCGCTGG CTTTCCAACG 3 960 

CCCGGATGAC TCACTATCAG GCCpTTGCTTT TGGACACGGA CCGGGTCCAG TTCGGACCGG 4 02 0 

TGGTAGCCCT GAACCCGGCT AC|GCTGCTCC CACTGCCTGA GGAAGGGCTG CAACACAACT 4080 

GCCTTGATAT CCTGGCCGAA GCCCACGGAA CCCGACCCGA CCTAACGGAC CAGCCGCTCC 414 0 

CAGACGCCGA CCACACCTGG TACACGGATG GAAGCAGTCT CTTACAAGAG GGACAGCGTA 42 00 

AGGCGGGAGC TGCGGTGACC ACCGAGACCG AGGTAATCTG GGCTAAAGCC CTGCCAGCCG 4 2 60 

GGACATCCGC TCAGCGGGCT GAACTGATAG CACTCACCCA GGCCCTAAAG ATGGCAGAAG 4320 
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GTAAGAAGCT AAATGTTTAT ACTGATAGCC GTTATGCTTT TGCTACTGCC CATATCCATG 4 3 80 

GAGAAATATA CAGAAGGCGT GGGTTGCTCA CATCAGAAGG CAAAGAGATC AAAAATAAAG 4 44 0 

ACGAGATCTT GGCCCTACTA AAAGCCCTCT TTCTGGCCAA AAGACTTAGC ATAATCCATT 4 500 

GTCCAGGACA TCAAAAGGGA CACAGCGCCG AGGCTAGAGG CAACCGGATG GCTGACCAAG 45 6 0 

CGGCCCGAAA GGCAGCCATC ACAGAGACTC CAGACACCTC TACCCTCCTC ATAGAAAATT 4 62 0 

CATCACGCTA CACCTCAGAA CATTTTCATT ACACAGTGAC TGATATAAAG GACCTAACCA 46 8 0 

AGTTGGGGGC CATTTATGAT AAAACAAAGA AGTATTGGGT CTACCAAGGA AAACCTGTGA 47 4 0 

TGCCTGACCA GTTTACTTTT GAATTATTAG ACTTTCTTCA TCAGCTGACT CACCTCAGCT 48 0 0 

TCTCAAAAAT GAAGGCTCTC CTAGAGAGAA GCCACAGTCC CTACTACATG CTGAACCGGG 48 6 0 

ATCGAACAGT CAAAAATATC ACTGAGACCT GCAAAGCTTG TGCACAAGTC AACGCCAGCA 4 92 0 

AGTCTGCCGT TAAACAGGGA ACTAGGGTCC GCGGGCATCG GCCCGGCACT CATTGGGAGA 49 8 0 

TCGATTTCAC CGAGATAAAG CCCGGATTGT ATGGCTATAA ATATCTTCTA GTTTTTATAG 5 04 0 

ATACCTTTTC TGGCTGGATA GAAGCCTTCC CAACCAAGAA AGAAACCGCC AAGGTCGTAA 510 0 

CCAAGAAGCT ACTAGAGGAG ATCTTCCCCA GGTTCGGCAT GCCTCAGGTA TTGGGAACTG 516 0 

ACAATGGGCC TGCCTTCGTC TCCAAGGTGA GTCAGACAGT GGCCGATCTG TTGGGGATTG 522 0 

ATTGGAAATT ACATTGTGCA TACAGACCCC AAAGCTCAGG CCAGGTAGAA AGAATGAATA 52 80 

GAACCATCAA GGAGACTTTA ACTAAATTAA CGCTTGCAAC TGGCTCTAGA GACTGGGTGC 53 4 0 

TCCTACTCCC CTTAGCCCTG TACCGAGCCC GCAACACGCC GGGCCCCCAT GGCCTCACCC 54 00 

CATATGAGAT CTTATATGGG GCACCCCCGC CCCTTGTAAA CTTCCCTGAC CCTGACATGA 54 6 0 

CAAGAGTTAC TAACAGCCCC TCTCTCCAAG CTCACTTACA GGCTCTCTAC TTAGTCCAGC 5 52 0 

ACGAAGTCTG GAGACCTCTG GCGGCAGCCT ACCAAGAACA ACTGGACCGA CCGGTGGTAC 55 8 0 

CTCACCCTTA CCGAGTCGGC GACAjCAGTGT GGGTCCGCCG ACACCAGACT AAGAACCTAG 564 0 

AACCTCGCTG GAAAGGACCT TACACAGTCC TGCTGACCAC CCCCACCGCC CTCAAAGTAG 57 00 

ACGGCATCGC AGCTTGGATA CACGCCGCCC ACGTGAAGGC TGCCGACCCC GGGGGTGGAC 57 60 

CATCCTCTAG ACTGACATGG CGQGTTCAAC GCTCTCAAAA CCCCTTAAAA ATAAGGTTAA 582 0 

CCCGCGAGGC CCCCTAATCC CCjTAATTCT TCTGATGCTC AGAGGGGTCA GTACTGCTTC 5 8 80 

GCCCGGGTCC AGTCCTCATC AAGTCTATAA TATCACCTGG GAGGTAACCA ATGGAGATCG 5 94 0 

GGAGACGGTA TGGGCAACTT CTGGCAACCA CCCTCTGTGG ACCTGGTGGC CTGACCTTAC 6 000 

CCCAGATTTA TGTATGTTAG CGCACCATGG ACCATCTTAT TGGGGGCTAG AATATCAATC 6060 

CCCTTTTTCT TCTCCCCCGG GGCGCCCTTG TTGCTCAGGG GGCAGCAGCC CAGGCTGTTC 612 0 

CAGAGACTGC GAAGAACCTT TAACCTCCCT CACCCCTCGG TGCAACACTG CCTGGAACAG 6180 
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ACTCAAGCTA GACCAGACAA CTCATAAATC AAATGAGGQA TTTTATGTTT GCCCCGGGCC 62 4 0 

CCACCGCCCC CGAGAATCCA AGTCATGTGG GGGTCCAGAC TCCTTCTACT GTGCCTATTG 63 00 

GGGCTGTGAG ACAACCGGTA GAGCTTACTG GAAGCCCTCC TCATCATGGG ATTTCATCAC 63 6 0 

AGTAAACAAC AATCTCACCT CTGACCAGGC TGTCCAGGTA TGCAAAGATA ATAAGTGGTG 642 0 

CAACCCCTTA GTTATTCGGT TTACAGACGC CGGGAGACGG GTTACTTCCT GGACCACAGG 64 8 0 

ACATTACTGG GGCTTACGTT TGTATGTCTC CGGACAAGAT CCAGGGCTTA CATTTGGGAT 6 54 0 

CCGACTCAGA TACCAAAATC TAGGACCCCG CGTCCCAATA GGGCCAAACC CCGTTCTGGC 6600 

AGACCAACAG CCACTCTCCA AGCCCAAACC TGTTAAGTCG CCTTCAGTCA CCAAACCACC 6660 

CAGTGGGACT CCTCTCTCCC CTACCCAACT TCCACCGGCG GGAACGGAAA ATAGGCTGCT 672 0 

AAACTTAGTA GACGGAGCCT ACCAAGCCCT CAACCTCACC AGTCCTGACA AAACCCAAGA 67 8 0 

GTGCTGGTTG TGTCTAGTAG CGGGACCCCC CTACTACGAA GGGGTTGCCG TCCTGGGTAC 6 84 0 

CTACTCCAAC CATACCTCTG CTCCAGCCAA CTGCTCCGTG GCCTCCCAAC ACAAGTTGAC 6900 

CCTGTCCGAA GTGACCGGAC AGGGACTCTG CATAGGAGCA GTTCCCAAAA CACATCAGGC 6960 

CCTATGTAAT ACCACCCAGA CAAGCAGTCG AGGGTCCTAT TATCTAGTTG CCCCTACAGG 7 02 0 

TACCATGTGG GCTTGTAGTA CCGGGCTTAC TCCATGCATC TCCACCACCA TACTGAACCT 7080 

TACCACTGAT TATTGTGTTC TTGTCGAACT CTGGCCAAGA GTCACCTATC ATTCCCCCAG 714 0 

CTATGTTTAC GGCCTGTTTG AGAGATCCAA CCGACACAAA AG AG AAC C GG TGTCGTTAAC 7 2 00 

CCTGGCCCTA TTATTGGGTG GACTAACCAT GGGGGGAATT GCCGCTGGAA TAGGAACAGG 7 2 60 

GACTACTGCT CTAATGGCCA CTCAGCAATT CCAGCAGCTC CAAGCCGCAG TACAGGATGA 732 0 

TCTCAGGGAG GTTGAAAAAT CAATCTCTAA CCTAGAAAAG TCTCTCACTT CCCTGTCTGA 7 3 80 

AGTTGTCCTA CAGAATCGAA GGGGCCTAGA CTTGTTATTT CTAAAAGAAG GAGGGCTGTG 7 44 0 

TGCTGCTCTA AAAGAAGAAT GTTGCTTCTA TGCGGACCAC ACAGGACTAG TGAGAGACAG 7 5 00 

CATGGCCAAA TTGAGAGAGA GGCTTAATCA GAGACAGAAA CTGTTTGAGT CAACTCAAGG 7 5 60 

ATGGTTTGAG GGACTGTTTA ACAGATCGCC TTGGTTTACC ACCTTGATAT CTACCATTAT 7 62 0 

GGGACCCCTC ATTGTACTCC TAATGATTTT GCTCTTCGGA CCCTGCATTC TTAATCGATT 7 680 

AGTCCAATTT GTTAAAGACA GGATATCAGT GGTCCAGGCT CTAGTTTTGA CTCAACAATA 7740 

TCACCAGCTG AAGCCTATAG AGTACGAGCC ATAGATAAAA TAAAAGATTT TATTTAGTCT 7 800 

CCAGAAAAAG GGGGGAATGA AAGACCCCAC CTGTAGGTTT GGCAAGCTAG CTTAAGTAAC 7 860 

GCCATTTTGC AAGGCATGGA AAAATACATA ACTGAGAATA GAGAAGTTCA GATCAAGGTC 7 92 0 

AGGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 7 980 
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CCCCGGCTCA GGGCCAAGAA CAGATGGAAC AGCTGAATAT GGGCCAAACA GGATATCTGT 804 0 

GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC AAGAACAGAT GGTCCCCAGA TGCGGTCCAG 8100 

CCCTCAGCAG TTTCTAGAGA ACCATCAGAT GTTTCCAGGG TGCCCCAAGG ACCTGAAATG 8160 

ACCCTGTGCC TTATTTGAAC TAACCAATCA GTTCGCTTCT CGCTTCTGTT CGCGCGCTTC 822 0 

TGCTCCCCGA GCTCAATAAA AGAGCCCACA ACCCCTCACT CGGGGCGCCA GTCCTCCGAT 82 8 0 

TGACTGAGTC GCCCGGGTAC CCGTGTATCC AATAAACCCT CTTGCAGTTG CA 83 3 2 

(2) INFORMATION FOR SEQ ID NO : 1 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGGAGTGGTA ACAGTCTGGC CTTAATTCTC AG 3 2 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CGGTCGACCT CGAGAATTAA TTC 2 3 



(2) INFORMATION FOR SEQ ID NO; 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTGGGAGACG TCCCAGGGAC TTC 2 3 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGCCAGACTG TTACCACTCC CTGAAGTTTG AC 3 2 



(2) INFORMATION FOR SEQ ID NO : 2 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CATCGATAAA ATAAAAGATT TTATTTAGTC 3 0 
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(2) INFORMATION FOR SEQ ID NO : 2 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
CAAATGAAAG ACCCCCGCTG AC 2 2 



(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAAGCTTCTC CCAGAACCCA CCAGTCTTGA AACGCCATC 3 9 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GTACCAGCTT TTGGTCTCAT CAAAG 2 5 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTCCTCGAGC TAAAGATATT TTAGAGAAGA ATTAAC 3 6 



(2) INFORMATION FOR SEQ ID NO : 2 5 : 
(i) SEQUENCE CHARACTEI&STICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TTCCTCTGGA CAGCTGTCTA CTTTG 2 5 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCGAGGGGCC CAGATCTGCG GCCGCTCGCG AGTCGACAAG CTTGGATCCA T 51 
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(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGATGGATCC AAGCTTGTCG ACTCGCGAGC GGCCGCAGAT CTGGGCCCC 4 9 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
CACCGTCGTC GACTTATGCT 2 0 



(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GACCGTCGAC TCAATTCTGG GAGAAGCTTC TTGG 3 4 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
CACCGTCGTC GACTTATGCT 2 0 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
CAACGCTCGA GAAGCAGAAT CGCAAAAGGC 3 0 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 2 : 
TCGGCTCGAG GCATCAACGG GAAATAACTC GT 3 2 
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(2) INFORMATION FOR SEQ ID NO : 3 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 
CCGACTCGAG TCAGTAGAGG TCCTGTGCCT C 31 




(2) INFORMATION FOR SEQ ID NO : 3 4 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 4 : 
GCGACTCGAG CATGGGGCCC TGGGGC 2 6 



(2) INFORMATION FOR SEQ ID NO : 3 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 
GCACTGGAAT TCGTCAGGGC G 21 

(2) INFORMATION FOR SEQ ID NO : 3 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ' ID NO : 3 6 : 
CGCGCCGCTC GAGTCTACAA TGGCCTTGAC CTTTGCTTTA CTGG 44 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 4 3 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 
CGCGCCGCTC GAGCATCCAA TGGCCCTGTC CTTTTCTTTA CTTATGG 47 
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(2) INFORMATION FOR SEQ ID NO : 3 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 3 9 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CCGGATCCTC TACAATGGCC TTGACCTTTG CTTTACTGG 3 9 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CGCGCCGGCG GCCGCTCATT CCTTACTTCT TAAACTTTCT TGCAAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CCGGATCCCA TCCAATGGCC CTGTCCTTTT CTTTACTTAT GG 42 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT TAATCTTTTT TGCAAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 44: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGGGGGGGGG GGGGGGGGGG GGGTGAGCAC ATCCAGTGGG TAAAGTTCCT TAAAATGCTC 6 0 

TGCAAAGAAA TTGGGACTTT TCATTAAATC AGAAATTTTA CTTTTTTCCC CTCCTGGGAG 12 0 

CTAAAGATAT TTTAGAGAAG AATTAACCTT TTGCTTCTCC AGTTGAACAT TTGTAGCAAT 180 
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AAGTCATGCA AATAGAGCTC TCCACCTGCT TCTTTCTGTG CCTTTTGCGA TTCTGCTTTA 240 

GTGCCACCAG AAGATACTAC CTGGGTGCAG TGGAACTGTC ATGGGACTAT ATGCAAAGTG 3 00 

ATCTCGGTGA GCTGCCTGTG GACGCAAGAT TTCCTCCTAG AGTGCCAAAA TCTTTTCCAT 3 60 

TCAACACCTC AGTCGTGTAC AAAAAGACTC TGTTTGTAGA ATTCACGGAT CACCTTTTCA 42 0 

ACATCGCTAA GCCAAGGCCA CCCTGGATGG GTCTGCTAGG TCCTACCATC CAGGCTGAGG 4 80 

TTTATGATAC AGTGGTCATT ACACTTAAGA ACATGGCTTC CCATCCTGTC AGTCTTCATG 54 0 

CTGTTGGTGT ATCCTACTGG AAAGCTTCTG AGGGAGCTGA ATATGATGAT CAGACCAGTC 600 

AAAGGGAGAA AGAAGATGAT AAAGTCTTCC CTGGTGGAAG CCATACATAT GTCTGGCAGG 660 

TCCTGAAAGA GAATGGTCCA ATGGCCTCTG ACCCACTGTG CCTTACCTAC TCATATCTTT 720 

CTCATGTGGA CCTGGTAAAA GACTTGAATT CAGGCCTCAT TGGAGCCCTA CTAGTATGTA 7 80 

GAGAAGGGAG TCTGGCCAAG GAAAAGACAC AGACCTTGCA CAAATTTATA CTACTTTTTG 84 0 

CTGTATTTGA TGAAGGGAAA AGTTGGCACT CAGAAACAAA GAACTCCTTG ATGCAGGATA 9 00 

GGGATGCTGC ATCTGCTCGG GCCTGGCCTA AAATGCACAC AGTCAATGGT TATGTAAACA 9 60 

GGTCTCTGCC AGGTCTGATT GGATGCCACA GGAAATGAGT CTATTGGCAT GTGATTGGAA 102 0 

TGGGCACCAC TCCTGAAGTG CACTCAATAT TCCTCGAAGG TCACACATTT CTTGTGAGGA 10 80 

ACCATCGCCA GGCGTCCTTG GAAATCTCGC CAATAACTTT CCTTACTGCT CAAACACTCT 1140 

TGATGGACCT TGGACAGTTT CTACTGTTTT GTCATATCTC TTCCCACCAA CATGATGGCA 12 00 

TGGAAGCTTA TGTCAAAGTA GACAGGTGTC CAGAGGAACC CCAACTACGA ATGAAAAATA 12 60 

ATGAAGAAGC GGAAGACTAT GATGATGATC TTACTGATTC TGAAATGGAT GTGGTCAGGT 13 2 0 

TTGATGATGA CAACTCTCCT TCCTTTATCC AAATTCGCTC AGTTGCCAAG AAGCATCCTA 13 80 

AAACTTGGGT ACATTACATT GCTGCTGAAG AGGAGGACTG GGACTATGCT CCCTTAGTCC 1440 

TCGCCCCCGA TGACAGAAGT TATAAAAGTC AATATTTGAA CAATGGCCCT CAGCGGATTG 15 00 

GTAGGAAGTA CAAAAAAGTC CGATTTATGG CATACACAGA TGAAACCTTT AAGACTCGTG 156 0 

AAGCTATTCA GCATGAATCA GGAATCTTGG GACCTTTACT TTATGGGGAA GTTGGAGACA 162 0 

CACTGTTGAT TATATTTAAG AATCAAGCAA GCAGACCATA TAACATCTAC CCTCACGGAA 16 80 

TCACTGATGT CCGTCCTTTG TATTCAAGGA GATTACCAAA AGGTGTAAAA CATTTGAAGG 17 4 0 

ATTTTCCAAT TCTGCCAGGA GAAATATTCA AATATAAATG GACAGTGACT GTAGAAGATG 18 00 

GGCCAACTAA ATCAGATCCT CGGTGCCTGA CCCGCTATTA CTCTAGTTTC GTTAATATGG 18 60 

AGAGAGATCT AGCTTCAGGA CTCATTGGCC CTCTCCTCAT CTGCTACAAA GAATCTGTAG 192 0 

ATCAAAGAGG AAACCAGATA ATGTCAGACA AGAGGAATGT CATCCTGTTT TCTGTATTTG 19 80 

ATGAGAACCG AAGCTGGTAC CTCACAGAGA ATATACAACG CTTTCTCCCC AATCCAGCTG 2 040 

GAGTGCAGCT TGAGGATCCA GAGTTCCAAG CCTCCAACAT CATGCACAGC ATCAATGGCT 2100 

ATGTTTTTGA TAGTTTGCAG TTGTCAGTTT GTTTGCATGA GGTGGCATAC TGGTACATTC 2160 

TAAGCATTGG AGCACAGACT GACTTCCTTT CTGTCTTCTT CTCTGGATAT ACCTTCAAAC 222 0 

ACAAAATGGT CTATGAAGAC ACACTCACCC TATTCCCATT CTCAGGAGAA ACTGTCTTCA 22 80 

TGTCGATGGA AAACCCAGGT CTATGGATTC TGGGGTGCCA CAACTCAGAC TTTCGGAACA 2 3 40 

GAGGCATGAC CGCCTTACTG AAGGTTTCTA GTTGTGACAA GAACACTGGT GATTATTACG 2 400 

AGGACAGTTA TGAAGATATT TCAGCATACT TGCTGAGTAA AAACAATGCC ATTGAACCAA 2 460 

GAAGCTTCTC CCAGAATTCA AGACACCCTA GCACTAGGCA AAAGCAATTT AATGCCACCA 2 52 0 

CAATTCCAGA AAATGACATA GAGAAGACTG ACCCTTGGTT TGCACACAGA ACACCTATGC 2 580 

C T AAAATAC A AAATGTCTCC TCTAGTGATT TGTTGATGCT CTTGCGACAG AGTCCTACTC 2 640 

CACATGGGCT ATCCTTATCT GATCTCCAAG AAGCCAAATA TGAGACTTTT TCTGATGATC 27 00 

CATCACCTGG AGCAATAGAC AGTAATAACA GCCTGTCTGA AATGACACAC TTCAGGCCAC 27 60 

AGCTCCATCA CAGTGGGGAC ATGGTATTTA CCCCTGAGTC AGGCCTCCAA TTAAGATTAA 2 82 0 

ATGAGAAACT GGGGACAACT GCAGCAACAG AGTTGAAGAA ACTTGATTTC AAAGTTTCTA 2 880 

GTACATCAAA TAATCTGATT TCAACAATTC CATCAGACAA TTTGGCAGCA GGTACTGATA 2 940 

ATACAAGTTC CTTAGGACCC CCAAGTATGC CAGTTCATTA TGATAGTCAA TTAGATACCA 3 000 

CTCTATTTGG CAAAAAGTCA TCTCCCCTTA CTGAGTCTGG TGGAGCTCTG AGCTTGAGTG 3 060 

AAGAAAATAA TGATTCAAAG TTGTTAGAAT CAGGTTTAAT GAATAGCCAA GAAAGTTCAT 312 0 

GGGGAAAAAA TGTATCGTCA ACAGAGAGTG GTAGGTTATT TAAAGGGAAA AGAGCTCATG 3180 

GACCTGCTTT GTTGACTAAA GATAATGCCT TATTCAAAGT TAGCATCTCT TTGTTAAAGA 3 240 

CAAACAAAAC TTCCAATAAT TCAGCAACTA ATAGAAAGAC TCACATTGAT GGCCCATCAT 3 3 00 

TATTAATTGA GAATAGTCCA TCAGTCTGGC AAAATATATT AGAAAGTGAC ACTGAGTTTA 3 3 60 

AAAAAGTGAC ACCTTTGATT CATGACAGAA TGCTTATGGA CAAAAATGCT ACAGCTTTGA 342 0 

GGCTAAATCA TATGTCAAAT AAAACTACTT CATCAAAAAA CATGGAAATG GTCCAACAGA 34 80 

AAAAAGAGGG CCCCATTCCA CCAGATGCAC AAAATCCAGA TATGTCGTTC TTTAAGATGC 3 540 

TATTCTTGCC AGAATCAGCA AGGTGGATAC AAAGGACTCA TGGAAAGAAC TCTCTGAACT 3 6 00 

CTGGGCAAGG CCCCAGTCCA AAGCAATTAG TATCCTTAGG ACCAGAAAAA TCTGTGGAAG 3 6 60 

GTCAGAATTT CTTGTCTGAG AAAAACAAAG TGGTAGTAGG AAAGGGTGAA TTTACAAAGG 372 0 

ACGTAGGACT CAAAGAGATG GTTTTTCCAA GCAGCAGAAA CCTATTTCTT ACTAACTTGG 37 80 

ATAATTTACA TGAAAATAAT ACACACAATC AAGAAAAAAA AATTCAGGAA GAAATAGAAA 3 840 
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AGAAGGAAAC ATTAATCCAA GAGAATGTAG TTTTGCCTCA GATACATACA GTGACTGGCA 3 900 

CTAAGAATTT CATGAAGAAC GTTTTCTTAC TGAGCACTAG GCAAAATGTA GAAGGTTCAT 3 960 

ATGACGGGGC ATATGCTCCA GTACTTCAAG ATTTTAGGTC ATTAAATGAT TCAACAAATA 4 02 0 

GAACAAAGAA ACACACAGCT CATTTCTCAA AAAAAGGGGA GGAAGAAAAC TTGGAAGGCT 408 0 

TGGGAAATCA AACCAAGCAA ATTGTAGAGA AATATGCATG CACCACAAGG ATATCTCCTA 4140 

ATACAAGCCA GGAGAATTTT GTCACGCAAC GTAGTAAGAG AGCTTTGAAA CAATTCAGAC 42 0 0 

TCCCACTAGA AGAAACAGAA CTTGAAAAAA GGATAATTGT GGATGACACC TCAACCCAGT 42 6 0 

GGTCCAAAAA CATGAAACAT TTGACCCCGA GGACCCTCAC ACAGATAGAC TACAATGAGA 43 2 0 

AGGAGAAAGG GGCCATTACT CAGTCTCCCT TATCAGATTG CCTTACGAGG AGTCATAGCA 43 80 

TCCCTCAAGC AAATAGATCT CCATTACCCA TTGCAAAGGT ATCATCATTT CCATCTATTA 4 44 0 

GACCTATATA TCTGACCAGG GTCCTATTCC AAGACAACTC TTCTCATCTT CCAGCAGCAT 4 500 

CTTATAGAAA GAAAGATTCT GGGGTCGAAG AAAGCAGTCA TTTCTTACAA GGAGCCAAAA 4 560 

AAAATAACCT TTCTTTAGCC ATTCTAACCT TGGAGATGAC TGGTGATCAA AGAGAGGTTG 462 0 

GCTCCCTGGG GACAAGTGCC ACAAATTCAG TCACATACAA GAAAGTTGAG AACACTGTTC 4680 

TCCCGAAACC AGACTTGCCC AAAACATCTG GCAAAGfTGA ATTGCTTCCA AAAGTTCACA 47 40 

TTTATCAGAA GGACCTATTC CCTACGGAAA CTAGCAATGG GTCTCCTGGC CATCTGGATC 4800 

TCGTGGAAGG GAGCCTTCTT CAGGGAACAG AGGGAGCGAT TAAGTGGAAT GAAGCAAACA 48 60 

GACCTGGAAA AGTTCCCTTT CTGAGAGTAG CAACAG&AAG CTCTGCAAAG ACTCCCTCCA 4 92 0 

AGCTATTGGA TCCTCTTGCT TGGGATAACC ACTATQGTAC TCAGATACCA AAAGAAGAGT 49 80 

GGAAATCCCA AGAGAAGTCA CCAGAAAAAA CAGCTT?TTAA GAAAAAGGAT ACCATTTTGT 5 04 0 

CCCTGAACGC TTGTGAAAGC AATCATGCAA TAGCA^CAAT AAATGAGGGA CAAAATAAGC 510 0 

CCGAAATAGA AGTCACCTGG GCAAAGCAAG GTAGG^CTGA AAGGCTGTGC TCTCAAAACC 5160 

CACCAGTCTT GAAACGCCAT CAACGGGAAA TAACTCGTAC TACTCTTCAG TCAGATCAAG 52 2 0 

AGGAAATTGA CTATGATGAT ACCATATCAG TTGA^LATGAA GAAGGAAGAT TTTGACATTT 52 8 0 

ATGATGAGGA TGAAAATCAG AGCCCCCGCA GGTT^CAAAA GAAAACACGA CACTATTTTA 53 40 

TTGCTGCAGT GGAGAGGCTC TGGGATTATG GGATGAGTAG CTGCCCACAT GTTCTAAGAA 5400 

ACAGGGCTCA GAGTGGCAGT GTCCCTCAGT TCA^GAAAGT TGTTTTCCAG GAATTTACTG 5460 

ATGGCTCCTT TACTCAGCCC TTATACCGTG GAGAACTAAA TGAACATTTG GGACTCCTGG 5 52 0 

GGCCATATAT AAGAGCAGAA GTTGAAGATA ATATCATGGT AACTTTCAGA AATCAGGCCT 5580 

CTCGTCCCTA TTCCTTCTAT TCTAGCCTTA TTTCTTATGA GGAAGATCAG AGGCAAGGAG 5640 

CAGAACCTAG AAAAAACTTT GTCAAGCCTA ATQAAACCAA AACTTACTTT TGGAAAGTGC 57 00 

AACATCATAT GGCACCCACT AAAGATGAGT TTGACTGCAA AGCCTGGGCT TATTTCTCTG 57 60 

ATGTTGACCT GGAAAAAGAT GTGCACTCAG GCCTGATTGG ACCCCTTCTG GTCTGCCACA 5 82 0 

CTAACACACT GAACCCTGCT CATGGGAGAC AAGTGACAGT ACAGGAATTT GCTCTGTTTT 5 8 80 

TCACCATCTT TGATGAGACC AAAAGCTGGT ACTTCACTGA AAATATGGAA AGAAACTGCA 59 4 0 

GGGCTCCCTG CAATATCCAG ATGGAAGATC CCACTTTTAA AGAGAATTAT CGCTTCCATG 6 00 0 

CAATCAATGG CTACATAATG GATACACTAC QTGGCTTAGT AATGGCTCAG GATCAAAGGA 6 06 0 

TTCGATGGTA TCTGCTCAGC ATGGGCAGCA ATGAAAACAT CCATTCTATT CATTTCAGTG 612 0 

GACATGTGTT CACTGTACGA AAAAAAGAGG AGTATAAAAT GGCACTGTAC AATCTCTATC 6180 

CAGGTGTTTT TGAGACAGTG GAAATGTTAC CATCCAAAGC TGGAATTTGG CGGGTGGAAT 62 4 0 

GCCTTATTGG CGAGCATCTA CATGCTGGGA TGAGCACACT TTTTCTGGTG TACAGCAATA 6 3 00 

AGTGTCAGAC TCCCCTGGGA ATGGCTTCTG /GACACATTAG AGATTTTCAG ATTACAGCTT 63 6 0 

CAGGACAATA TGGACAGTGG GCCCCAAAGC 1 TGGCCAGACT TCATTATTCC GGATCAATCA 642 0 

ATGCCTGGAG CACCAAGGAG CCCTTTTCTT: GGATCAAGGT GGATCTGTTG GCACCAATGA 64 80 

TTATTCACGG CATCAAGACC CAGGGTGCCC GTCAGAAGTT CTCCAGCCTC TACATCTCTC 6 54 0 

AGTTTATCAT CATGTATAGT CTTGATGGGA AGAAGTGGCA GACTTATCGA GGAAATTCCA 6600 

CTGGAACCTT AATGGTCTTC TTTGGCAATG TGGATTCATC TGGGATAAAA CACAATATTT 6660 

TTAACCCTCC AATTATTGCT CGATACATCC GTTTGCACCC AACTCATTAT AGCATTCGCA 672 0 

GCACTCTTCG CATGGAGTTG ATGGGCTGTG ATTTAAATAG TTGCAGCATG CCATTGGGAA 67 8 0 

TGGAGAGTAA AGCAATATCA GATGCACACjA TTACTGCTTC ATCCTACTTT ACCAATATGT 6 84 0 

TTGCCACCTG GTCTCCTTCA AAAGCTCGAC TTCACCTCCA AGGGAGGAGT AATGCCTGGA 6 9 00 

GACCTCAGGT GAATAATCCA AAAGAGTGGC TGCAAGTGGA CTTCCAGAAG ACAATGAAAG 69 6 0 

TCACAGGAGT AACTACTCAG GGAGTAAAAT CTCTGCTTAC CAGCATGTAT GTGAAGGAGT 7 02 0 

TCCTCATCTC CAGCAGTCAA GATGGCCATC AGTGGACTCT CTTTTTTCAG AATGGCAAAG 7 080 

TAAAGGTTTT TCAGGGAAAT CAAGACT£CT TCACACCTGT GGTGAACTCT CTAGACCCAC 7140 

CGTTACTGAC TCGCTACCTT CGAATTC^CC CCCAGAGTTG GGTGCACCAG ATTGCCCTGA 72 00 

GGATGGAGGT TCTGGGCTGC GAGGCAC^GG ACCTCTACTG AGGGTGGCCA CTGCAGCACC 72 6 0 

TGCCACTGCC GTCACCTCTC CCTCCTCjAGC TCCAGGGCAG TGTCCCTCCC TGGCTTGCCT 73 2 0 

TCTACCTTTG TGCTAAATCC TAGCAGACAC TGCCTTGAAG CCTCCTGAAT TAACTATCAT 73 8 0 

GAGTCCTGCA TTTCTTTGGT GGGGGGCCAG GAGGGTGCAT CCAATTTAAC TTAACTCTTA 74 4 0 

CCTATTTTCT GCAGCTGCTC CCAGATTACT CCTTCCTTCC AATATAACTA GGCAAAAAGA 7 500 
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AGTGAGGAGA AACCTGCATG AAAGCATTCT TCCCTGAAAA GTTAGGCCTC TCAGAGTCAC 7 560 

CACTTCCTCT GTTGTAGAAA AACTATGTGA TGAAACTTTG AAAAAGATAT TTATGATGTT 7 62 0 

AACATTTCAG GTTAAGCCTC ATACGTTTAA AATAAAACTC TCAGTTGTTT ATTATCCTGA 7 6 80 

TCAAGCATGG AACAAAGCAT GTTTCAGGAT CAGATCAATA CAATCTTGGA GTCAAAAGGC 7740 

AAATCATTTG GACAATCTGC AAAATGGAGA GAATACAATA ACTACTACAG TAAAGTCTGT 7 800 

TTCTGCTTCC TTACACATAG ATATAATTAT GTTATTTAGT CATTATGAGG GGCAGATTCT 7 860 

TATCTCCAAA ACTAGCATTC TTAAACTGAG AATTATAGAT GGGGTTCAAG AATCCCTAAG 792 0 

TCCCCTGAAA TTATATAAGG CATTCTGTAT AAATGCAAAT GTGCATTTTT CTGACGAGTG 79 80 

TCCATAGATA TAAAGCCATT TGGTCTTAAT TCTGACCAAT AAAAAAATAA GTCAGGAGGA 8 04 0 

TGCAATTGTT GAAAGCTTTG AAATAAAATA ACAATGTCTT CTTGAAATTT GTGATGGCCA 810 0 

AGAAAGAAAA TGATGATGAC ATTAGGCTTC TAAAGGACAT ACATTTAATA TTTCTGTGGA 816 0 

AATATGAGGA AAATCCATGG TTATCTGAGA TAGGAGATAC AAACTTTGTA ATTCTAATAA 82 2 0 

TGCACTCAGT TTACTCTCTC CCTCTACTAA TTTCCTGCTG AAAATAACAC AACAAAAATG 82 8 0 

TAACAGGGGA AATTATATAC CGTGACTGAA AACTAGAGTC CTACTTACAT AGTTGAAATA 83 40 

TCAAGGAGGT CAGAAGAAAA TTGGACTGGT GAAAACAGAA AAAACACTCC AGTCTGCCAT 8 40 0 

ATCACCACAC AATAGGATCC CCCTTCTTGC CCTCCACCCC CATAAGATTG TGAAGGGTTT 8 46 0 

ACTGCTCCTT CCATCTGCCT GACCCCTTCA CTATGACTAC AC AG AATC TC CTGATAGTAA 8 52 0 

AGGGGGCTGG AGGCAAGGAT AAGTTATAGA GCAGTTGGAG GAAGCATCCA AAGATTGCAA 8 58 0 

CCCAGGGCAA ATGGAAAACA GGAGATCCTA ATATGAAAGA AAAATGGATC CCAATCTGAG 8 64 0 

AAAAGGCAAA AGAATGGCTA CTTTTTTCTA TGCTGGAGTA TTTTCTAATA ATCCTGCTTG 87 0 0 

ACCCTTATCT GACCTCTTTG GAAACTATAA CATAGQTGTC ACAGTATAGT CACAATCCAC 87 60 

AAATGATGCA GGTGCAAATG GTTTATAGCC CTGTGAAGTT CTTAAAGTTT AGAGGCTAAC 8 82 0 

TTACAGAAAT GAATAAGTTG TTTTGTTTTA TAGCCCGGTA GAGGAGTTAA CCCCAAAGGT 8880 

GATATGGTTT TATTTCCTGT TATGTTTAAC TTAATAATCT TATTTTGGCA TTCTTTTCCC 8940 

ATTGACTATA TACATCTCTA TTTCTCAAAT GTTCATGGAA CTAGCTCTTT TATTTTCCTG 9 0 00 

CTGGTTTCTT CAGTAATGAG TTAAATAAAA CATTGACACA TACAAAAAAA AAAAAAAAAA 90 6 0 

AAAAAAAAAA AAAAAAAAAA 9080 

(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 10 15 

Cys Phe Ser Ala Thr Arg Arg/ Tyr Tyr Leu Gly Ala Val Glu Leu Ser 

20 ' 25 30 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 40 45 

Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn lie 
65 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 



His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 
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Glu Gly Ala Glu 

130 

Asp Lys Val Phe 
145 

Lys Glu Asn Gly 



Tyr Leu Ser His 
180 

Gly Ala Leu Leu 
195 

Gin Thr Leu His 
210 

Lys Ser Trp His 
225 

Ala Ala Ser Ala 



Val Asn Arg Ser 
260 

Tyr Trp His Val 
275 

Phe Leu Glu Gly 
290 

Leu Glu lie Ser 

305 

Asp Leu Gly Gin 



Asp Gly Met Glu 
340 

Gin Leu Arg Met 
355 

Leu Thr Asp Ser 
370 

Pro Ser Phe lie 
385 

Trp Val His Tyr 



Leu Val Leu Ala 
420 

Asn Gly Pro Gin 
435 



Tyr Asp Asp Gin 

135 

Pro Gly Gly Ser 
150 

Pro Met Ala Ser 
165 

Val Asp Leu Val 



Val Cys Arg Glu 
200 

Lys Phe lie Leu 
215 

Ser Glu Thr Lys 
230 

Arg Ala Trp Pro 
245 

Leu Pro Gly Leu 



He Gly Met Gly 
280 

His Thr Phe Lqu 
295 

Pro He Thr Phe 
310 

Phe Leu Leu f>he 
325 

Ala Tyr Val Lys 



Lys Asn Asn Glu 
360 

Glu Met Asp Val 
37 5 

Gin He Arg Ser 
390 

He Ala Ala Glu 
405 

Pro Asp Asp Arg 



Arg He Gly Arg 
440 



Thr Ser Gin Arg 

140 

His Thr Tyr Val 
155 

Aap Pro Leu Cys 
170 

Lys Asp Leu Asn 
185 

Gly Ser Leu Ala 



Leu Phe Ala Val 
220 

Asn Ser Leu Met 
235 

Lys Met His Thr 
250 

He Gly Cys His 
265 

Thr Thr Pro Glu 



Val Arg Asn His 

300 

Leu Thr Ala Gin 
315 

Cys His He Ser 
330 

Val Asp Ser Cys 
345 

Glu Ala Glu Asp 



Val Arg Phe Asp 
380 

Val Ala Lys Lys 
395 

Glu Glu Asp Trp 
410 

Ser Tyr Lys Ser 
425 

Lys Tyr Lys Lys 



Glu Lys Glu Asp 



Trp Gin Val Leu 
160 

Leu Thr Tyr Ser 
175 

Ser Gly Leu He 
190 

Lys Glu Lys Thr 

205 

Phe Asp Glu Gly 



Gin Asp Arg Asp 
240 

Val Asn Gly Tyr 
255 

Arg Lys Ser Val 

270 

Val His Ser He 
285 

Arg Gin Ala Ser 



Thr Leu Leu Met 
320 

Ser His Gin His 
335 

Pro Glu Glu Pro 
350 

Tyr Asp Asp Asp 
365 

Asp Asp Asn Ser 



His Pro Lys Thr 
400 

Asp Tyr Ala Pro 
415 

Gin Tyr Leu Asn 
430 

Val Arg Phe Met 
445 



Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala He Gin His Glu 
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450 455 460 

Ser Gly lie Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn He Tyr Pro 

485 490 495 

His Gly He Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 

Gly Val Lys His Leu Lys Asp Phe Pro He Leu Pro Gly Glu He Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 570 575 

Ser Val Asp Gin Arg Gly Asn Gin He Met Ser Asp Lys Arg Asn Val 
580 585 590 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

Pro Glu Phe Gin Ala Ser Asn lie Met His Ser He Asn Gly Tyr Val 
625 630 635 640 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp He Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro 
755 760 765 



Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Thr He Pro Glu Asn Asp 
770 775 780 
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He Glu Lys Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys 
785 790 795 800 

He Gin Asn Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gin Ser 

805 810 815 

Pro Thr Pro His Gly Leu Ser Leu Ser Asp Leu Gin Glu Ala Lys Tyr 
820 825 830 

Glu Thr Phe Ser Asp Asp Pro Ser Pro Gly Ala He Asp Ser Asn Asn 
835 840 845 

Ser Leu Ser Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly 
850 855 860 

Asp Met Val Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu 
865 870 875 880 

Lys Leu Gly Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys 

885 890 895 

Val Ser Ser Thr Ser Asn Asn Leu He Ser Thr He Pro Ser Asp Asn 
900 905 910 

Leu Ala Ala Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met 
915 920 925 

Pro Val His Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys 
930 935 940 

Ser Ser Pro Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu 
945 950 955 960 

Asn Asn Asp Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu 

965 970 975 

Ser Ser Trp Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe 
980 985 990 

Lys Gly Lys Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala 
995 1000 1005 

Leu Phe Lys Val Ser He Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn 
1010 1015 1020 

Asn Ser Ala Thr Asn Arg Lys Thr His He Asp Gly Pro Ser Leu Leu 
1025 1030 1035 1040 

He Glu Asn Ser Pro Ser Val Trp Gin Asn He Leu Glu Ser Asp Thr 

1045 1050 1055 

Glu Phe Lys Lys Val Thr Pro Leu He His Asp Arg Met Leu Met Asp 
1060 1065 1070 

Lys Asn Ala Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr 
1075 1080 1085 



Ser Ser Lys Asn Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro He 
1090 1095 1100 
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Pro Pro Asp Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe 
1105 1110 1115 1120 

Leu Pro Glu Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser 

1125 1130 1135 

Leu Asn Ser Gly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly 
1140 1145 1150 

Pro Glu Lys Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys 
1155 1160 1165 

Val Val Val Gly Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu 
1170 1175 1180 

Met Val Phe Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn 
1185 1190 1195 1200 

Leu His Glu Asn Asn Thr His Asn Gin Glu Lys Lys lie Gin Glu Glu 

1205 1210 1215 

He Glu Lys Lys Glu Thr Leu He Gin Glu Asn Val Val Leu Pro Gin 
1220 1225 1230 

He His Thr Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu 
1235 1240 1245 

Leu Ser Thr Arg Gin Asn Val Glu Gly Ser Tyr Asp Gly Ala Tyr Ala 
1250 1255 1260 

Pro Val Leu Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr 
1265 1270 1275 1280 

Lys Lys His Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu 

1285 1290 1295 

Glu Gly Leu Gly Asn Gin Thr Lys Gin He Val Glu Lys Tyr Ala Cys 
1300 1305 1310 

Thr Thr Arg He Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin 
1315 1320 1325 

Arg Ser Lys Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr 
1330 1335 1340 

Glu Leu Glu Lys Arg He He Val Asp Asp Thr Ser Thr Gin Trp Ser 
1345 1350 1355 1360 

Lys Asn Met Lys His Leu Thr Pro Ser Thr Leu Thr Gin He Asp Tyr 

1365 1370 1375 

Asn Glu Lys Glu Lys Gly Ala He Thr Gin Ser Pro Leu Ser Asp Cys 
1380 1385 1390 



Leu Thr Arg Ser His Ser He 
1395 



tfro Gin Ala Asn Arg Ser Pro Leu Pro 
400 1405 



He Ala Lys Val Ser Ser Phe Pro Ser He Arg Pro He Tyr Leu Thr 
1410 1415 1420 



Arg Val Leu Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr 
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1425 1430 1435 1440 

Arg Lys Lys Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly 

1445 1450 1455 

Ala Lys Lys Asn Asn Leu Ser Leu Ala lie Leu Thr Leu Glu Met Thr 
1460 1465 1470 

Gly Asp Gin Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser 
1475 1480 1485 

Val Thr Tyr Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu 
1490 1495 1500 

Pro Lys Thr Ser Gly Lys Val Glu Leu Leu Pro Lys Val His lie Tyr 
1505 1510 1515 1520 

Gin Lys Asp Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His 

1525 1530 1535 

Leu Asp Leu Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala lie 
1540 1545 1550 

Lys Trp Asn Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val 
1555 1560 1565 

Ala Thr Glu Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu 
1570 1575 1580 

Ala Trp Asp Asn His Tyr Gly Thr Gin lie Pro Lys Glu Glu Trp Lys 
1585 1590 1595 1600 

Ser Gin Glu Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr 

1605 1610 1615 

lie Leu Ser Leu Asn Ala Cys Glu Ser Asn His Ala lie Ala Ala lie 
1620 1625 1630 

Asn Glu Gly Gin Asn Lys Pro Glu lie Glu Val Thr Trp Ala Lys Gin 
1635 1640 1645 

Gly Arg Thr Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg 
1650 1655 1660 

His Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu 
1665 1670 1675 1680 

lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe 

1685 1690 1695 

Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys 
1700 1705 1710 

Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr 
1715 1720 1725 

Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly 
1730 1735 1740 



Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly 
1745 1750 1755 1760 
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Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly 

1765 1770 1775 

Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val 
1780 1785 1790 

Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu 
1795 1800 1805 

lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn 
1810 1815 1820 

Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His 
1825 1830 1835 1840 

His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr 

1845 1850 1855 

Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly 
1860 1865 1870 

Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg 
1875 1880 1885 

Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe Asp Glu 
1890 1895 1900 

Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala 
1905 1910 1915 1920 

Pro Cys Asn lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg 

1925 1930 1935 

Phe His Ala lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val 
1940 1945 1950 

Met Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Ser 
1955 1960 1965 

Asn Glu Asn lie His Ser lie His Phe Ser Gly His Val Phe Thr Val 
1970 1975 1980 

Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly 
1985 1990 1995 2000 

Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie Trp Arg 

2005 2010 2015 

Val Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu 
2020 2025 2030 

Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser 
2035 2040 2045 

Gly His lie Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin 
2050 2055 2060 



Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala 
2065 2070 2075 2080 
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Trp Ser Thr Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala 

2085 2090 2095 

Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin Lys Phe 
2100 2105 2110 

Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu Asp Gly 
2115 2120 2125 

Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val 
2130 2135 2140 

Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He Phe Asn 
2145 2150 2155 2160 

Pro Pro He He Ala Arg Tyr He Arg Leu His Pro Thr His Tyr Ser 

2165 2170 2175 

He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser 
2180 2185 2190 

Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp Ala Gin 
2195 2200 2205 

He Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro 
2210 2215 2220 

Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro 
2225 2230 2235 2240 

Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr 

2245 2250 2255 

Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr 
2260 2265 2270 

Ser Met Tyr Val Lys Glu Phe Leu He Ser Ser Ser Gin Asp Gly His 
2275 2280 2285 

Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly 
2290 2295 2300 

Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu 
2305 2310 2315 2320 

Leu Thr Arg Tyr Leu Arg He His Pro Gin Ser Trp Val His Gin He 

2325 2330 2335 

Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
2340 2345 2350 



(2) INFORMATION FOR SEQ ID NO : 4 6 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CTCGAGCTAA AGATATTTTA GAGAAGAATT AACCTTTTGC TTCTCCAGTT GAACATTTGT 60 
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AGCAATAAGT CATGCAAATA GAGCTCTCCA CCTGCTTCTT TCTGTGCCTT TTGCGATTCT 12 0 

GCTTTAGTGC CACCAGAAGA TACTACCTGG GTGCAGTGGA ACTGTCATGG GACTATATGC 180 

AAAGTGATCT CGGTGAGCTG CCTGTGGACG CAAGATTTCC TCCTAGAGTG CCAAAATCTT 2 40 

TTCCATTCAA CACCTCAGTC GTGTACAAAA AGACTCTGTT TGTAGAATTC ACGGATCACC 3 00 

TTTTCAACAT CGCTAAGCCA AGGCCACCCT GGATGGGTCT GCTAGGTCCT ACCATCCAGG 3 60 

CTGAGGTTTA TGATACAGTG GTCATTACAC TTAAGAACAT GGCTTCCCAT CCTGTCAGTC 42 0 

TTCATGCTGT TGGTGTATCC TACTGGAAAG CTTCTGAGGG AGCTGAATAT GATGATCAGA 4 80 

CCAGTCAAAG GGAGAAAGAA GATGATAAAG TCTTCCCTGG TGGAAGCCAT ACATATGTCT 540 

GGCAGGTCCT GAAAGAGAAT GGTCCAATGG CCTCTGACCC ACTGTGCCTT ACCTACTCAT 6 00 

ATCTTTCTCA TGTGGACCTG GTAAAAGACT TGAATTCAGG CCTCATTGGA GCCCTACTAG 660 

TATGTAGAGA AGGGAGTCTG GCCAAGGAAA AGACACAGAC CTTGCACAAA TTTATACTAC 720 

TTTTTGCTGT ATTTGATGAA GGGAAAAGTT GGCACTCAGA AACAAAGAAC TCCTTGATGC 7 80 

AGGATAGGGA TGCTGCATCT GCTCGGGCCT GGCCTAAAAT GCACACAGTC AATGGTTATG 84 0 

TAAACAGGTC TCTGCCAGGT CTGATTGGAT GCCACAGGAA ATCAGTCTAT TGGCATGTGA 90 0 

TTGGAATGGG CACCACTCCT GAAGTGCACT CAATATTCCT CGAAGGTCAC ACATTTCTTG 9 60 

TGAGGAACCA TCGCCAGGCG TCCTTGGAAA TCTCGCCAAT AACTTTCCTT ACTGCTCAAA 102 0 

CACTCTTGAT GGACCTTGGA CAGTTTCTAC TGTTTTGTCA TATCTCTTCC CACCAACATG 10 8 0 

ATGGCATGGA AGCTTATGTC AAAGTAGACA GCT^TCCAGA GGAACCCCAA CTACGAATGA 1140 

AAAATAATGA AGAAGCGGAA GACTATGATG ATGATCTTAC TGATTCTGAA ATGGATGTGG 12 00 

TCAGGTTTGA TGATGACAAC TCTCCTTCCT TTATCCAAAT TCGCTCAGTT GCCAAGAAGC 12 60 

ATCCTAAAAC TTGGGTACAT TACATTGCTG CTGAAGAGGA GGACTGGGAC TATGCTCCCT 13 2 0 

TAGTCCTCGC CCCCGATGAC AGAAGTTATA AAAGTCAATA TTTGAACAAT GGCCCTCAGC 13 8 0 

GGATTGGTAG GAAGTACAAA AAAGTCCGAT TTATGGCATA CACAGATGAA ACCTTTAAGA 1440 

CTCGTGAAGC TATTCAGCAT GAATCAGGAA TCtTGGGACC TTTACTTTAT GGGGAAGTTG 150 0 

GAGACACACT GTTGATTATA TTTAAGAATC AAGCAAGCAG ACCATATAAC ATCTACCCTC 156 0 

ACGGAATCAC TGATGTCCGT CCTTTGTATT CAAGGAGATT ACCAAAAGGT GTAAAACATT 162 0 

TGAAGGATTT TCCAATTCTG CCAGGAGAAA TATTCAAATA TAAATGGACA GTGACTGTAG 168 0 

AAGATGGGCC AACTAAATCA GATCCTCGGT GGCTGACCCG CTATTACTCT AGTTTCGTTA 174 0 

ATATGGAGAG AGATCTAGCT TCAGGACTCA TtGGCCCTCT CCTCATCTGC TACAAAGAAT 1800 

CTGTAGATCA AAGAGGAAAC CAGATAATGT CAGACAAGAG GAATGTCATC CTGTTTTCTG 186 0 

TATTTGATGA GAACCGAAGC TGGTACCTCA C^GAGAATAT ACAACGCTTT CTCCCCAATC 192 0 

CAGCTGGAGT GCAGCTTGAG GATCCAGAGT TtCAAGCCTC CAACATCATG CACAGCATCA 19 8 0 

ATGGCTATGT TTTTGATAGT TTGCAGTTGT C(AGTTTGTTT GCATGAGGTG GCATACTGGT 204 0 

ACATTCTAAG CATTGGAGCA CAGACTGACT tCCTTTCTGT CTTCTTCTCT GGATATACCT 210 0 

TCAAACACAA AATGGTCTAT GAAGACACAC tCACCCTATT CCCATTCTCA GGAGAAACTG 2160 

TCTTCATGTC GATGGAAAAC CCAGGTCTAT GGATTCTGGG GTGCCACAAC TCAGACTTTC 222 0 

GGAACAGAGG CATGACCGCC TTACTGAAGG fTTTCTAGTTG TGACAAGAAC ACTGGTGATT 22 8 0 

ATTACGAGGA CAGTTATGAA GATATTTCAG /CATACTTGCT GAGTAAAAAC AATGCCATTG 2 3 40 

AACCAAGAAG CTTCTCCCAG AACCCACCAG ; TCTTGAAACG CCATCAACGG GAAATAACTC 2 40 0 

GTACTACTCT TCAGTCAGAT CAAGAGGAAA ' TTGACTATGA TGATACCATA TCAGTTGAAA 2 46 0 

TGAAGAAGGA AGATTTTGAC ATTTATGATG, AGGATGAAAA TCAGAGCCCC CGCAGCTTTC 2 52 0 

AAAAGAAAAC ACGACACTAT TTTATTGCTG CAGTGGAGAG GCTCTGGGAT TATGGGATGA 2 580 

GTAGCTCCCC AC ATGTTC T A AGAAACAGGG CTCAGAGTGG CAGTGTCCCT CAGTTCAAGA 2 64 0 

AAGTTGTTTT CCAGGAATTT ACTGATGGCt CCTTTACTCA GCCCTTATAC CGTGGAGAAC 2 7 00 

TAAATGAACA TTTGGGACTC CTGGGGCCAT ATATAAGAGC AGAAGTTGAA GATAATATCA 2 7 60 

TGGTAACTTT CAGAAATCAG GCCTCTCGTb CCTATTCCTT CTATTCTAGC CTTATTTCTT 2 82 0 

ATGAGGAAGA TCAGAGGCAA GGAGCAGAAC CTAGAAAAAA CTTTGTCAAG CCTAATGAAA 2 880 

CCAAAACTTA CTTTTGGAAA GTGCAACATC ATATGGCACC CACTAAAGAT GAGTTTGACT 2 94 0 

GCAAAGCCTG GGCTTATTTC TCTGATGTTG ACCTGGAAAA AGATGTGCAC TCAGGCCTGA 3 000 

TTGGACCCCT TCTGGTCTGC CACACTA^CA CACTGAACCC TGCTCATGGG AGACAAGTGA 3 06 0 

CAGTACAGGA ATTTGCTCTG TTTTTCACCA TCTTTGATGA GACCAAAAGC TGGTACTTCA 312 0 

CTGAAAATAT GGAAAGAAAC TGCAGGGfTC CCTGCAATAT CCAGATGGAA GATCCCACTT 3180 

TTAAAGAGAA TTATCGCTTC CATGCAA^TCA ATGGCTACAT AATGGATACA CTACCTGGCT 3 240 

TAGTAATGGC TCAGGATCAA AGGATTCGAT GGTATCTGCT CAGCATGGGC AGCAATGAAA 3 3 00 

ACATCCATTC TATTCATTTC AGTGGACATG TGTTCACTGT ACGAAAAAAA GAGGAGTATA 3 3 60 

AAATGGCACT GTACAATCTC TATCCAGGTG TTTTTGAGAC AGTGGAAATG TTACCATCCA 3 42 0 

AAGCTGGAAT TTGGCGGGTG GAATGCCTTA TTGGCGAGCA TCTACATGCT GGGATGAGCA 3 48 0 

CACTTTTTCT GGTGTACAGC AATAAGTGTC AGACTCCCCT GGGAATGGCT TCTGGACACA 3 54 0 

TTAGAGATTT TCAGATTACA GCTTCAGGAC AATATGGACA GTGGGCCCCA AAGCTGGCCA 3 60 0 

GACTTCATTA TTCCGGATCA ATCAATGCCT GGAGCACCAA GGAGCCCTTT TCTTGGATCA 3 66 0 

AGGTGGATCT GTTGGCACCA ATGATTATTC ACGGCATCAA GACCCAGGGT GCCCGTCAGA 3 72 0 
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AGTTCTCCAG CCTCTACATC TCTCAGTTTA TCATCATGTA TAGTCTTGAT GGGAAGAAGT 37 80 

GGCAGACTTA TCGAGGAAAT TCCACTGGAA CCTTAATGGT CTTCTTTGGC AATGTGGATT 3 84 0 

CATCTGGGAT AAAACACAAT ATTTTTAACC CTCCAATTAT TGCTCGATAC ATCCGTTTGC 3 9 00 

ACCCAACTCA TTATAGCATT CGCAGCACTC TTCGCATGGA GTTGATGGGC TGTGATTTAA 3 9 60 

ATAGTTGCAG CATGCCATTG GGAATGGAGA GTAAAGCAAT ATCAGATGCA CAGATTACTG 4 02 0 

CTTCATCCTA CTTTACCAAT ATGTTTGCCA CCTGGTCTCC TTCAAAAGCT CGACTTCACC 40 80 

TCCAAGGGAG GAGTAATGCC TGGAGACCTC AGGTGAATAA TCCAAAAGAG TGGCTGCAAG 414 0 

TGGACTTCCA GAAGACAATG AAAGTCACAG GAGTAACTAC TCAGGGAGTA AAATCTCTGC 42 00 

TTACCAGCAT GTATGTGAAG GAGTTCCTCA TCTCCAGCAG TCAAGATGGC CATCAGTGGA 42 60 

CTCTCTTTTT TCAGAATGGC AAAGTAAAGG TTTTTCAGGG AAATCAAGAC TCCTTCACAC 43 2 0 

CTGTGGTGAA CTCTCTAGAC CCACCGTTAC TGACTCGCTA CCTTCGAATT CACCCCCAGA 43 8 0 

GTTGGGTGCA CCAGATTGCC CTGAGGATGG AGGTTCTGGG CTGCGAGGCA CAGGACCTCT 444 0 

ACTGAGGGTG GCCACTGCAG CACCTGCCAC TGCCGTCACC TCTCCCTCCT CAGCTCCAGG 4 5 00 

GCAGTGTCCC TCCCTGGCTT GCCTTCTACC TTTGTGCTAA ATCCTAGCAG ACACTGCCTT 4 5 60 

GAAGCCTCCT GAATTAACTA TCATCAGTCC TGCATTTCTT TGGTGGGGGG CCAGGAGGGT 4 62 0 

GCATCCAATT TAACTTAACT CTTACCTATT TTCTGCAGCT GCTCCCAGAT TACTCCTTCC 46 80 

TTCCAATATA ACTAGGCAAA AAGAAGTGAG GAGAAACCTG CATGAAAGCA TTCTTCCCTG 47 40 

AAAAGTTAGG CCTCTCAGAG TCACCACTTC CTCTGTTGTA GAAAAACTAT GTGATGAAAC 4 8 00 

TTTGAAAAAG ATATTTATGA TGTTGCGGCC GC 483 2 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1457 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 10 15 

Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 

2 0 ,25 3 0 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 40 : 45 

Phe Pro Pro Arg Val Pro Lys Sefr Phe Pro Phe Asn Thr Ser Val Val 

50 55 ' 60 

Tyr Lys Lys Thr Leu Phe Val G]|u Phe Thr Asp His Leu Phe Asn lie 
65 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 



Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 

165 170 175 
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Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie 
180 185 190 

Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 

Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 

245 250 255 



Val Asn Arg Ser 
260 



Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 

265 270 



Tyr Trp His Val 
275 



lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie 
280 285 



Phe Leu Glu Gly 
290 



His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
295 300 



Leu Glu He Ser 
305 



Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
310 315 320 



Asp Leu Gly Gin Phe Leu Leu Phe Cys H|s He Ser Ser His Gin His 

325 330 335 



Asp Gly Met Glu 
340 



Ala Tyr Val Lys Val A$p Ser Cys Pro Glu Glu Pro 

345 350 



Gin Leu Arg Met 
355 



Lys Asn Asn Glu Glu Aja Glu Asp Tyr Asp Asp Asp 
360 365 



Leu Thr Asp Ser 
370 



Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
375 380 



Pro Ser Phe He 
385 



Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
390 395 400 



Trp Val His Tyr He Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 

405 410 415 



Leu Val Leu Ala 
420 



Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 

425 430 



Asn Gly Pro Gin 
435 



Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
440 445 



Ala Tyr Thr Asp 
450 



Glu Thr Phe Lys Thr Axg Glu Ala He Gin His Glu 
455 460 



Ser Gly He Leu 
465 



Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
470 475 480 



Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn He Tyr Pro 

485 490 495 
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His Gly lie Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 

500 505 510 

Gly Val Lys His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 535 540 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

Asp Leu Ala Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu 

565 570 575 

Ser Val Asp Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val 
580 585 590 

lie Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

Asn lie Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

Pro Glu Phe Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val 
625 630 635 640 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr lie Leu Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala lie Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu 
755 760 765 

Lys Arg His Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin 
770 775 780 

Glu Glu lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu 
785 790 795 800 

Asp Phe Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe 

805 810 815 



Gin Lys Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp 
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820 825 830 

Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin 
835 840 845 

Ser Gly Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 
850 855 860 

Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
865 870 875 880 

Leu Gly Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie 

885 890 895 

Met Val Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
900 905 910 

Ser Leu lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg 
915 920 925 

Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val 
930 935 940 

Gin His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945 950 955 960 

Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 

965 970 975 

lie Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 
980 985 990 

Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe 
995 1000 1005 

Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 1015 1020 

Arg Ala Pro Cys Asn He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 1030 1035 1040 

Tyr Arg Phe His Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly 

1045 1050 1055 

Leu Val Met Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met 
1060 1065 1070 

Gly Ser Asn Glu Asn He His Ser He His Phe Ser Gly His Val Phe 
1075 1080 1085 

Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
1090 1095 1100 

Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly He 
1105 1110 1115 1120 

Trp Arg Val Glu Cys Leu He Gly Glu His Leu His Ala Gly Met Ser 

1125 1130 1135 



Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met 
1140 1145 1150 
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Ala Ser Gly His lie Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr 
1155 1160 1165 

Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He 
1170 1175 1180 

Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu 
1185 1190 1195 1200 

Leu Ala Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin 

1205 1210 1215 

Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu 
1220 1225 1230 

Asp Gly Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu 
1235 1240 1245 

Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He 
1250 1255 1260 

Phe Asn Pro Pro He lie Ala Arg Tyr He Arg Leu His Pro Thr His 
1265 1270 1275 1280 

Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu 

1285 1290 1295 

Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp 
1300 1305 1310 

Ala Gin He Thr Ala Ser Ser Tyr Pfte Thr Asn Met Phe Ala Thr Trp 
1315 1320 1325 

Ser Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
1330 1335 1340 

Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin 
1345 1350 1355 1360 

Lys Thr Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu 

1365 1370 1375 

Leu Thr Ser Met Tyr Val Lys Glu Phe Leu He Ser Ser Ser Gin Asp 
1380 1385 1390 

Gly His Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1395 1400 1405 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 1415 1420 

Pro Leu Leu Thr Arg Tyr Leu Arg He His Pro Gin Ser Trp Val His 
1425 1430 1435 1440 

Gin He Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu 

1445 1450 1455 



Tyr 
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INFORMATION FOR SEQ ID NO : 4 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acida 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Arg Gly Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr 
15 10 15 

Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu 
20 25 30 

Ser Lys Asn Asn Ala lie Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg 
35 40 45 

His Pro Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Pro Pro Thr Pro 
50 55 60 

Pro Thr Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr Arg Thr 
65 70 75 80 

Thr Leu Gin Ser Asp Gin Glu Glu lie Asp Tyr Asp Asp Thr lie Ser 

85 90 95 

Val Glu Met Lys 
100 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs! 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear i 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
AGAGGCATGA CCGCCTTACT GAAGGTTTCT AGTTGTGACA AGAACACTGG TGATTATTAC 60 
GAGGACAGTT ATGAAGATAT TTCAGCATAC TTGCTGAGTA AAAACAATGC CATTGAACCA 12 0 

AGAAGCTTCT CCCAGAATTC TAGACACCCT AGCACTAGGC AAAAGCAATT TAATGCCACC 180 
CCTCCTACAC CACCAACCCC ACCAGTACTG AAACGCCATC AACGGGAAAT AACTCGTACT 240 
ACTCTTCAGT CTGATCAAGA GGAAATTGAC TATGATGATA CCATATCAGT TGAAATGAAG 3 00 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: ! 

(A) LENGTH: 27 amino acid^; 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ !lD NO : 5 0 : 

Ser Arg His Pro Ser Thr Arg Glp. Lys Gin Phe Asn Ala Thr Pro Pro 
15 10 15 



Val Leu Lys Arg His Gin Arg Glu He Thr Arg 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 
TCGCGACACC CTAGCACTAG GCAAAAGCAA TTTAATGCCA CCCCACCAGT CCTGAAACGC 6 0 

CATCAACGGG AAATAACGCG T 81 



(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTACTCTTC AATCTGATCA AGAGGAA 2 7 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID |NI0:53: 
CGCGCCGCTC GAGTCTACAA TGGCTTTGCC TTTTGCTTTA CTG 4 3 



(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs ! 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single , 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID;NO:54: 
GCGCCCATCG ATTTATTCCT TCCTCCTTAA CCTTfTCTTGC AAG 43 

(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGCGCCGCTC GAGCATCCCA ATGGCCCTGT CCTTTTCTTT ACTGATGG 4 8 

(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: singly 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 3 9 
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(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCGGATCCTC TACAATGGCT TTGCCTTTTG CTTTACTG 3 8 



(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 8 : 
CGCGCCGGCG GCCGCTTATT CCTTCCTCCT TAACCTTTCT TGCAAG 4 6 



(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
CCGGATCCCA TCCCAATGGC CCTGTCCTTT TCTTTACTGA TGG 43 

(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single / 

(D) TOPOLOGY: linear ; 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT T^ATCTTTTT TGCAAG 46 

(2) INFORMATION FOR SEQ ID NO: 61:/ 
(i) SEQUENCE CHARACTERISTICS./: 

(A) LENGTH: 9 0 base pairs 

(B) TYPE: nucleic acid/ 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: /SEQ ID NO: 61: 
AGCTTGCTGT TTGTGTGCTG CCTCTGAA^T CCACACTGAA CAAACTTCAG CCTACTCATG 6 0 

TCCCTAAAAT GGGCAAACAT TGCAAGCAfeC 9 0 

(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
AAACAGCAAA CACACAGCCC TCCQTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 6 0 

ACCTCTCTGA 7 0 
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{2} INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:63: 
AGCTTCAGAG AGGTCTCTGA CCTCTGCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 6 0 

TGTGTTTGCT GTTTGCTGCT TG 82 



(2) INFORMATION FOR SEQ ID NO: 64: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CAATGTTTGC CCATTTTAGG GACATGAGTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 60 
GGCAGCACAC AAACAGCA 7 8 



(2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AATTCGCTGT TTGTGTGCTG CCTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 60 
TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 9 0 



(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
AAACAGCAAA CACACAGCCC TCCCTGCCTG C^GACCTTGG AGCTGGGGCA GAGGTCAGAG 60 
ACCTCTCTGG 7 0 



(2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
AATTCCAGAG AGGTCTCTGA CCTCTGCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 60 
TGTGTTTGCT GTTTGCTGCT TG 82 
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(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CAATGTTTGC CCATTTTAGG GACATGAGTA GGCTGAAGTT TGTTCAGTGT GGACTTCAGA 6 0 

GGCAGCACAC AAACAGCG 7 8 



(2) INFORMATION FOR SEQ ID NO: 69: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CGCGCCGCCC GGGGTAGATC TTGCTACCAG TGG 3 3 



(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GCGCCCGCGG CCGCCACTGT CCCAGGTCAG TGGTGGTGCC 4 0 



(2) INFORMATION FOR SEQ ID NO: 71: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear i 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 : 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 4 6 

(2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 4 3 



(2) INFORMATION FOR SEQ ID NO: 73: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 4 6 
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(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 4 3 




(2) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
Asn Ser Arg His Pro Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear / 
(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
AATTCGCGAC ACCCTAGC 18 



(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CAAAACCCAC CAGTCTTGAA ACGCCATCAA CGGGAAATAA CG 42 



(2) INFORMATION FOR SEQ ID NO: 79: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
GCGCTGTGGG ATCGGTTTTG GGTGGTCAGA AC 3 2 
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(2) INFORMATION FOR SEQ ID NO: 80: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
TTTGCGGTAG TTGCCCTTTA TTGC 2 4 

(2) INFORMATION FOR SEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
Arg Thr Leu Gin Ser Asp 
1 5 

(1!) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CGTACTCTTC AGTCT 15 

(2) INFORMATION FOR SEQ ID NO: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic.) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GCATGAGAAG TC AG AC TAG 19 



